PAT-tree-based Language Modeling with Initial Application of Chinese Speech Recognition Output Verification

نویسندگان

Chun-Liang Chen

Bo-Ren Bai

Lee-Feng Chien

Lin-Shan Lee

چکیده

In spontaneous speech recognition, there are always inevitable errors in the output due to the difficulties of acoustic recognition or linguistic decoding. In this paper, we present an output verification approach to detect and correct the errors automatically using the abundant Internet resources. The Syllable PAT tree (SPAT tree), a metamorphic data structure derived from the PAT tree concept, is a real N-gram language model and is first used as a verifier for speech recognition output in order to improve the accuracy of speech recognition. The verification approaches proposed here not only reduce the character error rate by 12.66% in preliminary experiments, but can make the recognition results more reliable for the following-up processing, such as semantic analysis in dialog control or speech understanding.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved context-dependent acoustic modeling for continuous Chinese speech recognition

This paper describes the new framework of context-dependent (CD) Initial/Final (IF) acoustic modeling using the decision tree based state tying for continuous Chinese speech recognition. The Extended Initial/Final (XIF) set is chosen as the basic speech recognition unit (SRU) set according to the Chinese language characteristics, which outperforms the standard IF set. An adaptive mixture increa...

متن کامل

English Alphabet Recognition Based on Chinese Acoustic Modeling

How to effectively recognize English letters spoken by Chinese people is our major concern in the paper. Some efforts are made to build Chinese extended Initial/Final (XIF) based HMMs for English alphabet recognition which can be integrated with large vocabulary continuous Chinese speech recognition (Chinese LVCSR) system based on a same XIF set. The alphabet-specific XIF HMMs are built using c...

متن کامل

Internet Chinese information retrieval using unconstrained Mandarin speech queries based on a client-server architecture and a PAT-tree-based language model

In order to pursue high performance of Chinese information access on the Internet, this paper presents an attractive approach with a successful integration of efficient speech recognition and information retrieval techniques. A working system based on the proposed approach for speech retrieval of real-time Chinese netnews services has been implemented and tested. Very exciting performance has b...

متن کامل

Acoustic modeling and language modeling for cantonese LVCSR

This paper describes our recent work on the development of a large-vocabulary, speaker-independent continuous speech recognition system for Cantonese (a major Chinese dialect). Both acoustic modeling and language modeling are being addressed. For acoustic modeling, we focus on right-context-dependent sub-syllable units. Tying of HMM at model as well as state level is applied based on phonetic k...

متن کامل

Mandarin Pronunciation Modeling Based on Cass Corpus1

The pronunciation variability is an important issue that must be faced with when developing practical automatic spontaneous speech recognition systems. In this paper, the factors that may affect the recognition performance are analyzed, including those specific to the Chinese language. By studying the INITIAL/FINAL (IF) characteristics of Chinese language and developing the Bayesian equation, w...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

PAT-tree-based Language Modeling with Initial Application of Chinese Speech Recognition Output Verification

نویسندگان

چکیده

منابع مشابه

Improved context-dependent acoustic modeling for continuous Chinese speech recognition

English Alphabet Recognition Based on Chinese Acoustic Modeling

Internet Chinese information retrieval using unconstrained Mandarin speech queries based on a client-server architecture and a PAT-tree-based language model

Acoustic modeling and language modeling for cantonese LVCSR

Mandarin Pronunciation Modeling Based on Cass Corpus1

عنوان ژورنال:

اشتراک گذاری